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0.  Summary 

Let  X  and  Y  be  independent  random  variables,  with  continuous 
distributions  F  c  5  and  G  e  ,  on  the  sample  space  X  .  Let  to  be 
a  homeomorphism  from  3C  onto  itself  and  define 

H(to)  =  /  F(o)dG  . 

From  samples  of  X  and  Y  we  form  f  and  G  which  are  estimates  of  F  and 
G  ,  respectively,  and  define  an  estimate  of  H  ,  say  H  , 

H(co)  =  /  F  (to)dG 

for  to  e  £2  ,  a  class  of  homeomorphisms  linearly  ordered  by  H. 

Interpreting  <o(Y)  as  the  strain  under  use  to  and  X  as  the  strength 
(of  some  device),  then  H(to)  =  P[X  <  to(Y)]  represents  the  unreliability  and 
H(co)  is  an  estimate  of  it.  If  we  use  H  to  determine  a  use  to  ,  then  what  is 
the  probability  that  the  true  unreliability  H(oo)  is  too  large?  We  examine  this 
problem  under  the  assumptions  that  F  (F  *)  and  G(G  *)  are  distribution-free 
with  respect  to  %  and  respectively.  This  provides  an  answer  in 

some  cases  and  allows  one  to  obtain  stochastic  bounds  in  others. 

Sponsored  by  the  Mathematics  Research  Center,  United  States  Army  under 
Contract  No.:  DA-11-022--ORD-2059. 


1.  Genesis  of  the  Problem 


We  were  asked  to  consider  the  problem  of  determining  the  unreliability 
of  a  missile  fuel  tank  for  a  given  weight  of  liquid  hydrocarbon  fuel  to  be 

installed.  The  volume  of  the  tank,  the  specific  weight  of  a  batch  of  fuel  and 

•  • 

the  temperature  environment  were  stochastic  variables. 

In  the  temperature-pressure  range  for  this  specific  problem  considerations 
of  the  physical  and  chemical  properties  of  the  fuel  show  that  the  design  pressure 
is  exceeded  if  the  following  event  obtains:  [X  <  wg(Y)]  ,  where  X  is  the  random 
volume  of  the  tank,  w  is  the  weight  of  the  fuel  installed  and  g  is  a  known 
function  (determined  by  the  bulk  modulus  of  the  fuel,  the  temperature  variation 
of  range  and  the  design  pressure  of  the  tank)  of  Y  the  random  specific  weight 
of  the  fuel  batch. 

■  If  a  weight  w  of  fuel  is  installed,  the  design  unreliability  of  the  tank, 
i.  e. ,  the  probability  of  the  pressure  exceeding  the  design  specification,  is 
H(w)  =  P[X  <  wg(Y)].  If  the  distribution  of  X  was  F  and  that  of  Y  was  G, 
both  known,  then  we  could  express 

00 

H(w)  =  f  F(wg(x))dG(x)  . 

0 

This  equation  defines  the  function  H  and  hence 

(a)  for  a  given  weight  w  of  fuel  installed  we  can  determine  the 
design  unreliability  H(w)  , 

(b)  for  a. specified  design  unreliability  of  at  most  «  we  can  seek 
the  maximum  weight  w  for  which  we  have  H(w)  <  «  . 
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However,  in  practice  the  distributions  F  and  G  are  usually  not  known  and 
we  have  only  sample  values  of  tank  volumes  and  the  sample  values  of  the 

*  a 

specific  weights  of  fuel  batches ‘with  which„to  arrive  somehow  at  answers  which 
correspond  to  the  cases  (a)  and  (b). 

It  is  the  study  of  the  statistical  problems  for  the  situations  (a)  and  (b), 
within  the  more  inclusive  formulation  of  the’problem  that  we  propose,  which 
constitutes  the  subject  matter  of  this  note. 

2.  Introduction  and  Related  Results 

Let  (36,6.)  be  a  measurable  space  for  which  (3C,H  )  is  a  partially 
ordered  set,  i.  e. ,  the  relation  is  reflexive,  transitive  and  such  that  x  -<  y 
and  y -<  x  imply  x  =  y.  If  the  relation  is  measurable,  i.  e. ,  for  all  xe  3£  the 
set  {yc3C  :  y  -<  x}  is  in  G-  ,  then  a  distribution  can  be  defined  on  3£  for 
each  measure  P  on  (X  by  F(x)  =  P[X  -<  x]  ,  with  the  usual  interpretation  of 
X  as  the  identity  random  variable  (r.  v.). 

If  for  each  xeX  the  set  [X  =  x]  has  measure  zero,  then  the  distribution 
F  is  continuous.  Now  X  is  called  a  positive  sample  space  if  and  only  if 
(iff)  for  each  x  *<  y  ,  x  =£  y  we  have  the  set  [X  y  x]r^[X  -<  y]  has  positive 
P-measure. 

Let  X  and  Y  be  random  variables  taking  values  in  the  positive  sample 
space  X  with  continuous  distributions  F  and  G,  respectively.  (The 
generalization  to  Y  taking  values  in  a  space  different  from  3ft  will  be  seen 
to  be  immediate. )  Let  n  be  a  set  of  transformations  on  3fc  onto  itself.  For 
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each  w  «  p.we  have  the  probability  that  X  precedes  w(Y)  according  to 

•  • 

given  by  ■ 


(2.1) 


H(w)  =  P[X  ^  w(Y)]  =  /  Fw  dG 


where  we  make  the  convention  that  juxtaposition  of  functions 'refers  to  composition 

and  integrals  are  understood  to  be  over  the  entire  space  36.  ..  .  • 

•  •  •  •  * 

Let  us  assume  that  while  F  and  G  are  unknown  we  can  obtain" samples 

*  • 

of  X  and  Y  from  which  we  form,  respectively,  the  estimates  F  and  G  of 
the  distributions.  Thus  we  define  an  estimate  of  H,*  say  H  ,  by 


(2.2) 


H(<4  =  J  Fw  dG 


for  each  u  «  J2  for  which  the  integrals  exist. 

*  ,  » 

Some  questions  of  interest  are:  In  what  sense  is  H  a  gbod  estimate 

*  • 

•  *  * 
of  H-?  If  «  is  known,  what  is  the  distribution  of  H(u>)  ?  But  primarily  we  want 

^ 

the  maximum  transformation  u  (as  a  function  of  H)  for  which  for  each  F  f  If'  , 
•  •••.*' 

•  •  •  ,  •  • 

we  have  "  . 

•  .  * 

•  •  * 

P[H(w)  >  e]  <  a 


where  a  and  «  are  small  and  specified  in  advance.  That  is,  we  are  interested 
in  th.e  problem  of  using  H  Id  obtain  what  would  l?e  tolerance  limits  were  H  a 
probability  distribution  on  n. 

•  • 

.If  consists  of  a  single  point  w,  we  can  without  loss  of  generality 

. 

assume  it  is  "the  identity  transformation.  We  then  have  the  problem  of  estimating. 
•  •  . 

p  =  P[X  ■<  Y]  from  samples  of  X  and  Y. 

•  Suppose  that  36  is  the  real  line  with  the  usual  ordering  and  *5  = 
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is  the  class  of  continuous  distributions.  Then  it  is  well  known  that  the  empiric 
•  • 

distribution,  say  G  ,  is  the  unique  minimum  variance  unbiased  estimate  of  G  . 

•  • 

But ,  further,  p  defined  analogously  to  (2.2)  is  the  unique  minimum  variance 

*  • 

0  •  0 

unbiased  estimate  of  p  and  mnp  is  the  Mann-Whitney  statistic  where  m  and. 

•  •  • 

n  are  the  sample  sizes  used  in  computing  F  and.  G  . 

The  problem  analogous  to  ours,  i.e.,  involving  p  and  p,  has  been 

studied  and  bounds  for  the  corresponding  estimates  have  been  obtained  for  large 
•  • 

sample  sizes  under  the  assumption  that  one  of  the  distributions  is  known 
»  * 

*  • 

(Birnbaum -[1])  and  assuming  that  neither  of. the  distributions  is  known  (Birnbaum 

•  *■ 

*  •  m 

and  McCarty  [2]). 


3.  -Results 

Let  _<  be. a  partial  ordering  on  the  set  of  all  transformations  of 

•  •  .  * 

•  ,  *  • 
onto  itself  for  which  (J2,  <)  is  a  linearly  ordered  subset’  which  is  order  complete. 

•  • 

•  % 

That  is,  (n,  <)  is  a  partially  ordered  set  for  which  any  two  elements  are  comparable 

and  each  non-void  subset  of  J2  which  has  a  lower  bound  has  an  infimum. 

For  a  sample  of  independent,  r.  v. 's  each  with  the  same  distribution 

•  • 

Q-  I-  /^/ 

F  (  y  ,  the  estimate  F  (which  is  a  measurable  function  of  the  sample)  of  F 
is  ample  for  iff  the  random  function  F  F  1  has  the  same  probability 

law  for  every  F  € 

’  ,  • 

We  now  make  our  assumptions:  _  *  * 

1°  %  and  are  classes  cf  continuous  distributions  on  the 

partially  ordered  positive  sample,  space  (  3f  ,•£  ) 

2 °  '(n,  <)  is  a  linearly  ordered  complete  space  of  transformations  on 

such  that 
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(3.  1) 


3°  F  and  G  are 


o)i  <  o)2  implie°s  wi  (x)  -<  o>2(x)  for  all  xc  . 

G  are  anjple  estimates  for  F  (  and  G  «  &  , 


respectively. 

Since  there  always  exists  a  set  of  transformations  on  36  ,  say  r  , 

>6  •  7  7 

such  that  =  {F0y  :  y  e  r}  and  similarly  =  {G0X.  :  X.  «  A  }  where  T 

* 

and  A  each  contain  the  identity  function,  it  is  seen  in  this  representation 

*  ^ 

that  F  is  ample  iff  Fv.  (or  equivalently  iff  yy  )  is  distribution-free  with 
•  • 

respect  to  ^  .  We  shall  adopt  this  representation  wherever  convenient, 

’without  comment,  in  what  follows.  * 

* 

The  set 


(3.2) 


$  =.  {yo)X  XeA  ,  ye  F  ,  o)eS2  } 


is  a  set  of  transformations  on  ’  36  ’  which  is,  in  general,  partially  ordered  by 


<  .  -Let 


(3.  3) 


H0(«t>)  =  /  Fo«fr  dG0  , 


(5.4) 


%  r  —  \  —  1 

H0(4>)  =  J  Fy  <)>  dGX. 


be  two  functions,  the  second  random,  defined  for  each  cj>e$  for  which  the 
•  •  ^ 

integrals  exist.  (The  functions  F  and  G  may  both  contain  discontinuities 
and  these  canno't  be  made  to  coincide.) 

W6  now  define  for  6  an  element  of  the  range  of  fT 


(3.5) 


w.  =  inf  {uei2  :  H(u))  >  6} 
6 
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which  by  completeness  of  ft  is  a  r.  v.  Let  c  be  in  the  range  of*  H0  and  H. 
We  now  pick  the  unique  . 


(3.6) 


0  eft  such  that  Ho(0.)  =  e 


and  there  exists  a  unique 


(3.7) 


co  eft  such  that  H(oo  )  =  e  . 
e  e 


We  now  have 

•  • 

Theorem  1:  Let  1°  ,  2°  ,  3°  be  true  and  H  and  H0  .be  defined  as  in  (2.1)  and 
(3.4)  respectively.  Then  if 

(3.7.  1)  ft  3  *  .  . 

we  have 


(3.8) 


P[H(co6)>  e]  <  P[6  >  H0  (  0£ )] 


with  the  right  hand  side  of  (3.  8)  being  a  distribution-free  bound  for 
every  (F,  G)e  £  . 

Proof.  It  is  clear  by  (3.  1)  with  probability  one  that  H  is  monotone ‘increasing 
on  ft  =  <£  (always  $  D  ft).  We  have  by  the  positivity  of  the  sample  space  that 
H  is  strictly  monotone  on  ft  and  thus  we  have 


(3.  8.  1) 


[H(  w  )  >  e]  =  [oT  >  w  ]  C  [6  >  H  (co  )] 


-1 


because  H  (oor)  <  6  .  Now  since  voo  \  =0  by  (3.7.  1)  we  have 

6  —  « 

Hn(6  )  =  H(co  )  that  i?0  (9  )  has  a  distribution  independent  of  %  x  is 
e  e  e 

clear  by  3°  . 

•  • 

Remark:  If  K  is  continuous  a.  s.  then  we  obtain  equality  between  the  probabilities 

•  • 

of  equation  (3.8).  _  . 

•  * 

•  •  • 
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Remark:  An  example  of  a  situation  when  the  assumption  3  ®  is  satisfied 
occyjrs  when  J2  is  a  semi-group, with  respect  to  composition,  of  homeomorphisms 
on  36  and  r  and  A  are  subsets  of  J2.  We  exhibit  a  trivial  instance  of 

*  this  kind  in  the  next  section.  ’  • 

*  • 

9 

We  now  state  .  * 

•  • 

Theorem  2:  Let  ’.l*  ,  2°  ,  3°  be  true  and  H  and  H0  be  defined  as  in  (2.1)  and 
•  (3.  4)  respectively.  Then  if 


(3.9)  for  each  (y,  \)«rx A  we  have  w  >  ycoX.  for  all 

•  then  we  have  •  . 

•  *  * 

•  a 

(3.10)  .  P[H(w6)  >.£]  <  p[6  >  H0(ee)] 

.  with  the  right-hand  side  being  a  distribution-free  bound  for  every 

(F,G)e'3x£ 

Proof.  -It  is  clear  by  (3.  1)  that  both  H  and  H  are  monotone  on  S2  .  Thus 
we  have 

•  * 

’[H(wJ’>€]  =  [«  ‘  >  o  ]C  [6  >  H(o)  )]  . 

•  •  o  o  e  *  £ 

*  -  -1 

Now  by  (3.9)  we  have  vu  \  <  to  and  hence 

e  —  e 

£  =  H(w  )  =  H  (vu  \_1)  <  H  (w  ) 

€  0  €  ““  0  € 

but  £  =  H  (0)  <:  H  ( w  )  and  since  0  ,  u  en  we  have  0  <  w  but  then 

*a.s.  H(w,)  >  H(0  )  and  it  follows  that 

t  €  •  • 

,  •  • 

[6>H(we)]C’  [6>H(0e)] 

and  the  theorem  is  proved.  '  ... 
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Remark:  A  useful  condition  which  implies  assumption  (3.9)  of  theorem  2 
above,  that  u>\  >  y<o  ,  is 


(3.  10.  1) 


Y  <  m\  and  either  X.u><  wX.  or  yu  <  uy  . 


In  most  instances  this  means  only  that  one  r.  v.  is  stochastically  smaller  than 

* 

the  other  and  that  one  of  the  r.  v.  's  has  a  distribution  which  satisfies  a  convexity 

t* 

type  condition. 

Let  us  make  the  additional  assumption 

4°  y  t  ~  are  1  -  1  transformations  in  r*  and  A  respectively.  We 
•  •  • 

can  in  this  case  define  * 


(3.  11). 


A  r  ^  —  1  <*>*/  —  1 

H  (<())  =  J  Fy  4>  dG  X. 


We  remark  that  by  3°  being  satisfied,  H‘(<|>)  has ‘a  distribution* which  depends 

0  • 

only  on  $  e  $  .  Let 


(3.  12) 


$a  =  {«{>€«*:  P[Ho(4>)  >  e]  =  a} 


.  For  a«  (0,  1),  &  is  not  empty  whenever  there  exists  0  e£2  such  that 
.  a  .a 

P[H  (0  ) •>  e]  =  a  !* 

•  o  a  — 

•  *  • 

We  now  state  without  proof: 

Theorem  3:  Let  1°  ,  2°,  3°,  4°.,  be’truewith  H  and  Hq  as  defined  as  in  (2.1) 

•  • 

•  • 

and  (3.  11)  respectively  then  if 

•  • 

(3.  13)  for  each  cj>€  *S>  there  exists  a  uhique  largest  element  in*  £2  ,  say 
<(>  ,  such  that  t*)  <  <t>  . 

*  a*  •’  «  iZ 
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(3.14) 


Then  writing  w  =  (v  <f>  \)  for  4>  e  $ 

£2  a 


P[H(w  )  >  e]  <  P[H  (<)>)>6]  =  a 
0 


and  the  right  hand  side  of  ( 3. 14)  is  distribution-free. 


Now  we  define 


(3.15) 


to  =  sup  (  y  <()  X.  )£2 
*  +‘*5 


as  the  element  to  be  used  in  our  estimate. 

Remark:  In  case  there  exist  a  4>  such  that  v  *  d>  \  =  w  e  £2  we  obtain 

a  a  a 

equality  between  the  probabilities  in  equatipn  (J.  14). 


4.  Examples 

Let  us  first  give  an  application  of  theorem  1  and  take  X  =  (0  ,00)  f 

•  « 
£2  =  {w  :  w(x)  =  tax,  w  >  0}  ,  i.e.,  here  we  take  the  transformations  to  to  be 

scalar  multiplication  by  a  positive  constant  to  which  we  give  the  same  designation. 

Let  the  orderings  on  £2  and  36  be  the  same  and  be  the  usual  ordering 

•  • 
on  the  real  line.  Let  F0  and  G0  be  distributions  on  36  and  take 

-  {G0«  :  w«£2}  H  ^  =  (F0w  :  we £2}  .  Without  loss  of  generality  we  may 

assume  that  EX  =  1/y  and  EY  =  1/X.  .  Now  we  define  F^(x)  =  F0(x/X)  , 

G(x)  =  G0(V?)  and  these  two  estimates  are  ample  for  %  and  jtf  ,  respectively. 

The  assumptions  of  theorem  one  are  seen  to  be  satisfied  and  0  is  the 

unique  element  of  £2  such,  that 

00 

f  F o  (0x)  dGo(x)  =  e 

’  0 


(4.  1) 
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but  then  0^  =  vco€  ^  so  a*  s*  we  have  by  the  corollary 


(4.  2) 


°o  0  x  • 

H(«  )  =  H  (0  )  =  /  F  (— — )  dG  (-*+) 
«  o  «  o  'vX  o  \y 


which  must  needs  be.tabulated  at  points  of  interest. 

The  integrations  (4.  1)  and  (4.2)  might  be  difficult  to  perform  and 

could  require  numerical  integration  techniques  depending  upon  F0  and  G0 

•  * 

In  order  to  continue,  let  us  make  the  mathematically  convenient  assumpti.on 

•  • 
that  the  densities  exist  and  are  given  by 


f  (x)  = 


x(k/2)-le-x/2 

r(k/2)2k//2 


,  g  (x)  =  e 
•  0 


where  k  is  a  known  parameter  of  the  chi-square  distribution. 

•  ^  • 

Thus  from  (4.  1)  we  obtain 


00  oo  .  * 

/  F  *(9x)e'Xdx  =  /  e~y/6f  ('y)dy  =  (1  +  2/9)"k/2 

o  o  •  0  • 

-2/k 

and  0£  =  2/(«  -  1),  but  fortunately  this  integration  solves  (4.2)  and  we. 


HQ(0e)  =  (1  +  ZyX/Q  \Y)"k/2 


from  which  the  probability  distribution  can  be  found  by  a  simple  transformation, 

using  tables  of  the  F  distribution,  since  yX  has  a  chi-square  distribution 

with  kn  degrees  of  freedom  and  2  \Y  has  a  chi-square  distribution  with  2m 

degrees  of  freedom  where  n  and  m  are  the  sample  sizes  used  in  computing 
• 

X  and  tY,  respectively. 
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5.  Ample  Estimates  of  Distributions 

In  this  note  we  restrict  ourselves  to  estimates  F  of  F  which  are 

•  n 

•  •  • 

ample.  This  property  could  be  of  some  importance  in  other  applications  since 
in  particular  F^  ample  for  %  tells  us  that  the  analogue  of  the  statistic, 

say/ 

D=  sup  |f(x)  -  F(x)  |  =  sup  Iff  1  (y)  —  y | 

•Xe  3£  ye(0,  1) 

is  distribution-free -with  respect  to 

.Ampleness  of  an  estimate  of  a  distribution  F  is  a  strong  property  since, 

•  •* 

in  many  instances,  it  allows  us  to  place  confidence  contours  along  the  entire 
*  .  • 

•  • 

distribution  function.  Suppose  F  is  ample  for  ',  a  set  of  distributions  on 
the  real  line,  and  r  is  some  distribution  function  on  the  unit  interval  which 

*  <*  •  /—w'  —  X  • 

is  in  the  range  of  F  F  and  for  some  constant  (3  we  have 

. 

P  =  P  [  F_1  <  4]  >  P-U^F^F] 

*  •  .  i 

*  • 

where  t,  (t)  =  inf  {jc  :  £,(x)  >  t}  then  £,  F  provides  a  lower  contour  for  F 

•  »  ,  t  *  • 

with  p  confidence. 

•  •  #  . 

We  have  ampleness  within  those  families  of  distributions  on  the  real 
*  • 

line  for  which  one  can  estimate  percentiles  and  bognds  on  those  estimates  from 

•  •  •  • 
observations  in  the  region  of  central  tendency  such  as  normal,  log-normal, 

exponential,  etc.  But  we  may  have  ampleness  for  distributions  on  spaces  of 

higher  dimension,  as  the  following  two  examples  show.  * 

Let  JO  be  the  set  of  ao  ritinuous  distributions  on  any  partially  ordered 

set  (X,<)  as  defined  in  section  2  . 


For  y  e  36  define  c(*  ,  y)  to  be  the 
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indicator  (characteristic)  function  of  the  set  [Y  -<  y]  .  Then 


(5.1) 


l  m 

G(x)  =  “  ^  c(x>Yj) 


is  an  ample  estimate  of  the  distribution  G  e  from  the  sample  (Y,, . . . ,  Y  ) 

l'  7  n 

of  independent  observations  each  with  distribution  G.  We  shall  call  G  the 


empiric  cumulative. 

Let  36  be  Euclidean  p-space,  with  -4.  the  usual  ordering,  the 

elements  of  which  we  shall  take  to  be  column  vectors.  Let  X  be  rcfc,  2)  and 

let  D  be  the  non- singular  matrix  such  that  DSD'  =1.  We  shall  say  here,  for 

brevity,  that  D  diagonalizes  S.  Then  we  can  write  F(x)  =  Fo[D(x  -  p)]  for 

•  • 

x €  36  ,  where  •  Fq  is  the  distribution  of  a  7*t(0,l)  variate. 

Let  p  and  S  be  the  usual  maximum  likelihood  estimates  taken  from 
a  sample  of  size  n  >  p  and  define  D  as  the  diagonalizing  matrix  for  2  .  It 
exists  a.  s.  Now  if  we  define 


(5.2)  *  cr(x)  =  D(x  -  p) 

and  the  corresponding  definition  of  cr  then  fix  y  e  Ik-  and  set 


(5.3)  T  =  £cr  \y)  =  DD  2[y  -  D(p  -  p)]  , 

that  this  has  a  distribution  which  is  the  same  for  all  p  ,  2"  is  sufficient  for 

ampleness*.  But  clearly  D(p  -  p)  is  0^.(0,  —  I)  so  it  is  sufficient  that  the 

n 

vector  in  square  brackets  in  (5.  3)  satisfies  the  condition. 

*  -1 

Set  B  =  DD  .  We  show  that  this  has  a  distribution  independent  of 
p  ,  2  also.  From  (5.  2)  we  have 
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(5.  4)  BD  £  D'B'  =  I  . 

n-1 

But  nDSD'  is  distributed  as  ^  Z,Z‘ ,  where  Z.  is  ^(0,1)  ‘independent 

i=l  1  1  . 

of  Z.(i  #  j)  . 

Hence,  except  for  zero  probability;  B  diagonalizes  a  random  matrix 

which  has  a  distribution  independent  of  p.,  2  .  Note  that  T'T  is,  except  for 

2  *  .  • 

a  scale  factor,  Hotelling's  T  with  non-centrality  factor  y'y/n  . 

•  •  • 

♦ 

.  • 

6.  Some  Remarks  on  Properties  of  the  Estimate  H 

Let  us  set*  F  =  EF  with  similar  meaning  for  G  and  TT.  'By  applying 

*  • 

the  Fubini  theorem  and  integrating  by  parts  we  have  for  co 

•  H(w)  =  J  To  d(T 

and 

•• 

H(w)  -  H(w)  =  /  Fu  d(G  -  G)  +  /  (Fw  -  Fco)  cfG  . 

Remark  6.  1:  H  is  an  unbiased  estimate  of  H  on  S2.  if  F  and  G  are  unbiased 

estimates  of  F  and  G,  respectively. 

The  converse  is  not  true  without  additional  assumptions.  For  the  real 

line,  unbiasedness  of  H  on  the  entire  set  of  1-1  order  preserving  maps  of 

onto  itself  is  sufficient  for  unbiasedness  of  F  and  G  when  one  imposes  the 

conditions  that  F  and  G  are  continuous. 

We  say  that  F  is  consistent  for  F  whenever 
n 

sap  I  Fn  -  F  |  -  0  , 

where,  as  before,  the  subscript  n  refers  to  the  sample  size  used  to  obtain 
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the  estimate.  We  shall  refer  to  consistency  exclusively  in  this  sense. 
Now 

H(cj)  -  H(o)  =  /  (G  -  G)  dFo  +  /  (Fw  -  Fw)  dG  . 

* 

Thus  it  follows  that 


|h(o)  -  H(o)  |  <  sup  I?'—  F|  +  sup  |G  r  g|  . 

~  X  X 

Remark  6.2:  If  and  G  are  consistent  for  F  and  G,  respectively,  then 
n  m  77 

H*  is  consistent  for  H  uniformly  on  £2  as  l/n  +  1/m  -*■  0  . 
mn  •  .  • 

Remark  6.  3:  If  F  and  G  are  the  empiric  cumulatives,  then  H  ,  for 
n  m  ’  .  mn  ’ 

each  g5€J2  ,  is  consistent,  unbiased  and  asymptotically  normal  with  l/n  +  1/m  -*•  0 
This  follows  from  the  known  behavior  of  U-Statistics. 

*  *  ro  •  ^ 

If  we  assume  F  is  consistent  and  G  is  the  empiric  cumulative,  we 
should  obtain  asymptotic  normality  if  f'  "  stabilizes"  rapidly  enough.  We  do  not 
attempt  to  find  the  weakest  such  conditions,  but  we  have 

Remark  6.  4:  If  G  is  the  empiric  cumulative,  then  the  function  H.  defined  by 


m 


H~»  =  m  Z  r„(“YJ> 


mn 


is  a  consistent  estimator  of  H  "and  asymptotically  normal  if  l/n  +  1/m  — 

in  such  a  way  that  \ I  m"  sup  |f^  -  f|  -♦  0  . 

n 

l  m 

Proof:  Let  Z  =  ~i=s  X  [F(wY.)  -  H(w)l  and  define  t,  by  the  equation 
m  v  m  ,  ^  j  m,  n 


_  j 

\Trr?  [H  («)  -  H(w)l  =  Z  +  r, 
mn  nr 


.  That.  Z  is  asymptotically  normal  is  clear, 
m  7 


and  the  result  follows  if  t, 

•'r 


0  ,  which  is  guaranteed  by  the  hypothesis. 
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7.  An  application  to  Reliability  Theory 

Let  Ik  ~  (0,  <»)  and  let  Y(w)  be  the  demand  time  for  a  particular 
equipment  in  a  specified  environment  during  an  inspection  period  of  length  co  . 
Let  X  be  the  life  length  of  this  equipment  under  continued  use  in  that 
environment. 

We  want  the  maximum  u  such  that  for  given  «  >  0 
P[X  <  Y(o>)]  <  e  . 

We  particularize  by  making  the  assumption  that  Y(w)  denotes  multiplication  and 
without  loss  of  generality  we  suppose  that  a  scaling  factor  has  been  introduced 
so  that  ooe  (0,  1)  .  Now 

H(cj)  =  P[X  <  toY]  =  /  F(coy)dG(y) 

where  we  assume  that  X  and  Y  are  independent  r.  v.  s.  with  continuous 
distributions  F  and  G  respectively. 

Now  if  F  is  a  distribution  with  derivative  f,  then  the  function  y‘ 
defined  by 

(7.1)  y' (x)  =  x for  *>0 

is  called  the  failure  rate  (or  hazard  rate). 

A  common  and  intuitively  appealing  assumption  concerning  life  length 
distributions  is 

A°  the  failure  rate  of  X  exists  and  is  non-decreasing. 

We  also  assume 

B°  X  is  stochastically  larger  than  Y,  i.e.,  F<G  . 
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But,  further,  without  loss  of  generality  we  assume 


(7.2) 


—  >  e  >  0  is  given  and  P[X  <  Y]  >  e 


Let 


x 


y(x)  =  f  y1  (t)  dt  for  x  >  0 
0 


Then  if  we  set 


F o  (x)  =  G0(x)  =  1  -  e  X  ,  x  >  0  , 


from  (7. 1)  we  see  that  G(x)  =  G0  k(x)  . 

It  is  clear  that  B°  implies  y  <  X  and  we  now  show  that  A°  implies 
yco  <  coy  and  thus  (3.  10.  1)  implies  assumption  (3.9)  of  theorem  2  .  To  see 
this,  note' that  y  must  be  .convex  and  hence  y(wx)  <  coy(x)  for  co  e  (0,  1) 
since  y(0)  =  0.  This  is  precisely  our  condition. 

We  now  seek  the  unique  0  such  that 
.  00 

(7.  3)  '  /  F o  (0x)  dG0(x)  =  e  . 

.  .  0 

This  may  be  easily  integrated  directly  or  by  comparison  with  (4.  1)  seen  to  be 


We  now  choose  F,  G  to  be  the  empiric  cumulatives  as  defined  in  (5.  1) 
(which  are  ample).  Hence  wfe  need  tabulations  of  the  statistic 

—  i  A  i  i  n  m 

H0(e)  =  f  F  F~  6  dG  G_  = —  J)  y  c(0U.  ,  V.) 

*  ,  J  mn  i  =  i  j“i  J  1 

where  F(Xi>  =  ,  i  =  1, . . . ,  n,  and  G(Y^)  =  for  j  =  1, .  . . ,  m  are  all 

independent  r.v. 's  uniform  on  (0,  1). 
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Consider  the  events 


\  "  ^V(k)  <  6  <  V(k+1)^  »  k  -  0,  .  . . ,  n  , 


where  V .......  ,V ,  .  are  the  ordered  observations  of  V  with  V,  ,  =  0  . 

(1)’  *  (n)  (o)  * 

V(n+i)  =  1  •  These  events  form  a  disjunct  partition  of  the  sample  space. 

It  is  clear  that  H0(l)  =  U  is  the  well-known  Mann-Whitney 

m,  n 

statistic  based  on  samples  of  size  m  and  n.  Now  let  S  (9)  =  mnH0 (0)  , 

mn 

and  it  follows  that 

S  (0)  |a.  =U  ,  for  k  =  0,  l,...,n  . 
mn  k  mk  *  *  * 

With  the  convention  that  U  =0  with  probability  one,  we  obtain 

mo 


p[smn<e)  <  t]  =  £  P[smn(e)  <  tlA^pt^] 


k=0 


mn 


'kJ*  l"kJ 


n 


=  X  b<k;  n,0)p[uk  <  t]  ,  t  =  o,i,. 

•  k=0 


We  know  that  ES  (9)  =  ■“ p~0  and  since  we  have 
mn  2 


ETJ2  =  ~r  ,(n  4  m  +  1)  +  Cnm/2)  , 
mn  12 


it  follows  by  (7.4)  directly  that 


n 

Z 

k=0 


ES2  (0)  =  f.  b(k:  n,  0)EU2, 
mn  <-*  ’  mk 


=  +  (  -a  +  ^)[„(„  -  lie2  +  ne] 


Hence  we  obtain  the  variance  of  S  (0)  as 

mn 


..  mn0r,  3m0  .  (n-l)0  . 

Var(S- (0))  -  —  [2m  -  -j-  4  N  -  L  4  1] 


mn 


,mn  , 


6 
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In  this  case,  for  example,  if  we  chose  6=0  we  would  find  that 


min 
max  Yj 


which  is  intuitively  appealing 


-22- 
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